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DETAILED ACTION 

1 . This communication is responsive to the Amendment filed 3 July 2006. 

2. Claims 1-37 are pending in this application. Claims 1, 16, 31, 32 and 35 are 
independent. In the Amendment filed 22 June 2006, claims 1 , 31 and 35 have been 
amended. This action is made Non-Final. 

3. The rejections of claims 1-32 and 37 as being unpatentable over the article 
"Topic Detection and Tracking Pilot Study Final Report" by Allan et al (hereafter Final 
Report) in view of the article "Relevance Models for Topic Detection and Tracking" by 
Lavrenko et al have been withdrawn as necessitated by the arguments. The rejections 
of claims 33-34 as being anticipated by the article "Relevance Models for Topic 
Detection and Tracking" by Lavrenko et al have been withdrawn as necessitated by the 
amendment. The rejections of claims 35-36 being anticipated by the article have been 
with based on the article "Relevance Models for Topic Detection and Tracking" by 
Lavrenko et al have been maintained. 

Claim Objections 

4. Claims 1 and 35 are objected to because of the following informalities: 
Claims 1 and 35 were amended, however, not all new information was 
underlined. 

Claim 35, line 5 recites, "a second source-identified associated." The claim is 
missing the word "story." 
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Appropriate correction is required. 

Claim Rejections - 35 USC § 101 

5. The 101 rejections have been withdrawn as necessitated by the amendment. 

Claim Rejections - 35 USC § 102 

6. The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that 
form the basis for the rejections under this section made in this Office action: 

A person shall be entitled to a patent unless - 

(a) the invention was known or used by others in this country, or patented or described in a printed 
publication in this or a foreign country, before the invention thereof by the applicant for a patent. 

7. Claims 35-36 are rejected under 35 U.S.C. 102(a) as being anticipated by the 
article "Relevance Models for Topic Detection and Tracking" by Lavrenko et al 
(hereafter Relevance Model). 

Referring to claim 35, Relevance Model teaches a computer-implemented 
method of detecting new events comprising the steps of: 

determining a first source-identified story [story S1] associated with at least one 
event (Relevance Model: see section 3.2: Measuring Topic Similarity); 

determining a second source-identified associated with at least one event 
(Relevance Model: see section 3.2: Measuring Topic Similarity - story S2); 

determining a story-pair based on the first source-identified story and the second 
source-identified story (Relevance Model: see section 3.2: Measuring Topic Similarity, 
lines 1-14-S1 and S2); 
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outputting an indicator of inter-story similarity between the first and second story 
based on at least one of: 

an event frequency model, story segmentation (Relevance Model: see 
section 1: Introduction, lines 3-7) and a source-identified inter-story similarity 
metric. 

Referring to claim 36, Relevance Model discloses the method of claim 35, 
wherein story segmentation is based on at least one of: topic (see section 1 : 
Introduction, lines 3-7), an adjacent window and an overlapping window. 

Claim Rejections - 35 (JSC § 103 

8. The following is a quotation of 35 U.S.C. 1 03(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

This application currently names joint inventors. In considering patentability of 
the claims under 35 U.S.C. 103(a), the examiner presumes that the subject matter of 
the various claims was commonly owned at the time any inventions covered therein 
were made absent any evidence to the contrary. Applicant is advised of the obligation 
under 37 CFR 1 .56 to point out the inventor and invention dates of each claim that was 
not commonly owned at the time a later invention was made in order for the examiner to 
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consider the applicability of 35 U.S.C. 103(c) and potential 35 U.S.C. 102(e), (f) or (g) 
prior art under 35 U.S.C. 103(a). 

9. Claims 1-2, 7-17, 22-32 and 37 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over the article "Topic Detection and Tracking Pilot Study Final Report" by 
Allan et al (hereafter Final Report) in view of the article "Relevance Models for Topic 
Detection and Tracking" by Lavrenko et al. 

Referring to claim 1, Final Report discloses a method of detecting new events 
(see section 1.1 : Background, lines 1-6) comprising the steps of: 

determining at least one story characteristic based on at least one of: 

an average story similarity story and a same event-same source story 

characteristic (see section 3.1: Detection Evaluation: lines 34-44 - the degree of 

match between an event and a cluster; a cluster is considered to represent the 

source); 

determining a source-identified story corpus, each story associated with at least 
one event (see section 1.2: The Corpus; and section 2.1: Evaluation, lines 11-16 - the 
corpus is the TDT study corpus (a source) has been developed by the TDT study effort 
and comprises of 16,000 stories from Reuters newswire and CNN broadcast news 
transcripts); 

determining a source-identified new story associated with at least one event (see 
section 3: New Event Detection, lines 19-24 - a Yes/No decision per story made at the 
time when the story arrives regarding whether or not the story is the first reference to a 
newly reported event); 
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determining story-pairs based on the source-identified new-story and each story 
in the source-identified story corpus (see section 3.1 : Detection Evaluation, lines 6-48 - 
each new story is associated with a cluster which is considered to represent a story- 
pair); 

determining at least one inter-story similarity metric [score] for the story-pairs 
(see section 3.1: Detection Evaluation, lines 30-33 - the score indicates how confident 
the system is that the story being processed discusses the cluster event); and 

outputting a new story event indicator [YES or NO decision] (see section 3.1: 
Detection Evaluation, lines 23-29) if the event associated with the new story is similar to 
the events associated with the source-identified story corpus based on the inter-story 
similarity metrics (see section 3.1: Detection Evaluation, lines 30-33 - score). 

However, Final Report fails to explicitly teach the limitation of determining at least 
one adjustment to the inter-story similarity metrics based on at least one story 
characteristic and then using the adjustment to output a new story indicator. Relevance 
Model also discloses a method for detecting new events similar to that of Final Report 
including the further limitation. In particular, Relevance Model discloses determining at 
least one adjustment to the inter-story similarity metrics based on at least one story 
characteristic (see section 4.3: Relevance Model Performance - the KL metric is 
adjusted by a clarity value) and using the adjustment to output a new event story 
indicator (see section 3.2: Measuring Topic Similarity). 

It would have been obvious to one of ordinary skill at the time the invention was 
made to utilize Relevance Model's method of determining at least one adjustment to the 
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inter-story similarity metrics based on at least one story characteristic with Final 
Report's method of detecting new events by using the KL metric and the score metric. 
One would have been motivated to do so since the Kullback-Leibler metric provides 
relevance modeling, which enhances the topic model estimate associated with a news 
story (Relevance Model: see abstract, lines 3-6). 

Referring to claim 2, the combination of Final Report and Relevance Model 
(hereafter FinalReport/RelevanceModel) teaches the method of claim 1 , wherein the 
inter-story similarity metric is adjusted based on at least one of subtraction (Relevance 
Model: see section 4.3: Relevance Model Performance, lines 1-13 - the symmetric 
clarity-adjusted KL is the similarity metric utilized; and section 4.2: Value of Clarity- 
adjusted KL, line 12 - the equation uses subtraction) and division (Relevance Model: 
see section 3.2: Measuring Topic Similarity, lines 31-37 - clarity is considered to 
represent the adjustment). 

Referring to claim 7, FinalReport/RelevanceModel teaches the method of claim 
1 , wherein the inter-story similarity metrics are comprised of: at least one story 
frequency model (Final Report: see section 4.1: Detection Experiments, lines 13-15); 
and 

at least one event frequency model combined using terms weights 
(Relevance/Model: see section 5.1: Tracking algorithm, lines 1-23). 

Referring to claim 8, FinalReport/RelevanceModel teaches the method of claim 
1 , wherein the inter-story similarity metrics are comprised of at least one story frequency 
model (Final Report: see section 4.1: Detection Experiments, lines 13-15); and 
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at least one story characteristic frequency model combined using terms weights 
(Relevance Model: see Figure 6). 

Referring to claim 9, FinalReport/RelevanceModel teaches the method of claim 
8, where the adjustments based on the story characteristics are applied to the term 
weights (Relevance Model: see section 1: Introduction, lines 29-31 and Figure 6). 

Referring to claim 10, FinalReport/RelevanceModel teaches the method of 
claim 8, where the adjustments based on the story characteristics are applied to the 
inter-story similarity metrics (Relevance Model: see section 4.3: Relevance Model 
Performance - the KL metric is adjusted by a clarity value). 

Referring to claim 11, FinalReport/RelevanceModel teaches the method of 
claim 1 , wherein the inter-story similarity metrics are comprised of at least one term 
frequency-inverse event frequency model (Final Report: see section 4.1: Event 
Detection, lines 1 1-23) and where the events are classified based on at least one of: 
story labels and a predictive model (Relevance Model: see section 5.1: Tracking 
Algorithm, lines 11-23). 

Referring to claim 12, FinalReport/RelevanceModel teaches the method of 
claim 8, wherein an event frequency is determined based on term t and ROI category 
rmax from the formula: ef rimx (t) = max(e/(r,/)) (Final Report: see section 3.2: Measuring 

Topic Similarity - the equation finds the probability of the topic). 

Referring to claim 13, FinalReport/RelevanceModel teaches the method of 
claim 8, wherein an the inverse event frequency is determined based on term t t and , 
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events e and rmax in the set of ROI categories from the formula: IEF(t) = log 



(Final Report: see section 3.5 Results, Analysis, and Future Work - finding the inverse 
document frequency is comparable to finding the inverse event frequency). 

Referring to claim 14, FinalReport/RelevanceModel teaches the method of 
claim 8, wherein an inverse event frequency is determined based on term t, categories 
e, r and rmax in the set of ROI categories and P(r), the probability of ROI r from the 



formula: IEF\t) = £P(r)log 



(Final Report: see section 3.5 Results, Analysis, 



and Future Work - finding the inverse document frequency is comparable to finding the 
inverse event frequency; the derivative of the first equation has been taken). 

Referring to claim 15, FinalReport/RelevanceModel teaches the method of 
claim 1 further comprising the step of determining a subset of stories from the source- 
identified story corpus and the source-identified new story based on at least one story 
characteristic (Final Report: see section 3.1: Detection Evaluation - the stories within 
the corpus are placed in clusters which is considered to represent a subset). 

Referring to claim 16, the claim is rejected on the same grounds as claim 1. 

Referring to claim 17, the claim is rejected on the same grounds as claim 2. 

Referring to claim 22, the claim is rejected on the same grounds as claim 7. 

Referring to claim 23, the claim is rejected on the same grounds as claim 8. 

Referring to claim 24, the claim is rejected on the same grounds as claim 9. 

Referring to claim 25, the claim is rejected on the same grounds as claim 10. 

Referring to claim 26, the claim is rejected on the same grounds as claim 1 1 . 
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Referring to claim 27, the claim is rejected on the same grounds as claim 12. 

Referring to claim 28, the claim is rejected on the same grounds as claim 13. 

Referring to claim 29, the claim is rejected on the same grounds as claim 14. 

Referring to claim 30, the claim is rejected on the same grounds as claim 15. 

Referring to claim 31, the claim is rejected on the same grounds as claim 1. 

Referring to claim 32, the claim is rejected on the same grounds as claim 1. 

Referring to claim 33, FinalReport/RelevanceModel discloses the method of 
claim 1 , inter-story similarity information combined comprising the steps of: 

determining P(sameROI(q,d)) based on the probability of story q and story d 
having the same ROI category (Relevance Model: see section 3.2: Measuring Topic 
Similarity -according to paragraph [0013], lines 6-7 of the applicant's specification, the 
definition of an ROI category is a categorization of events; the topic is considered to 
represent an event); 

determining similarity iE F", based on a similarity with no inverse event frequency 
influence (Relevance Model: see section 3.2: Measuring Topic Similarity); and 

the formula: similarity , (q,d)=P(sameROI(q,d))* similarityiEF' (q,d)+(1- 
P(sameROI(q,d)))* similarityi E F"(q,d) (Relevance Model: see section 3.2: Measuring 
Topic Similarity, lines 15-17 - the formula is an extended version of D(M1||M2) + 
D(M2||M1)). 

Referring to claim 34, FinalReport/RelevanceModel teaches the method of 
claim 33, wherein P(sameROI(q,d)) is based on the formula: 

P(sameROI( q ,d)) = ^(similarity IEr (g,d)) ; (Relevance 

N same {similarity IEF . (q, d)) + N differetll ((similarity IEF . (q,d)) 
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Model: see section 3.2: Measuring Topic Similarity, lines 31-44 - clarity represents 
Ndiff). 

Referring to claim 37, FinalReport/RelevanceModel discloses the computer- 
implemented method of claim 1 , in which the new event indicator is displayed on at 
least one of a visual, audio or tactile output device (Final Report: Section 3.6: Open 
Issues). 



10. Claims 3-5 and 18-20 are rejected under 35 U.S.C. 103(a) as being unpatentable 
over the article Topic Detection and Tracking Pilot Study Final Report" by Allan et al in 
view of the article "Relevance Models for Topic Detection and Tracking" by Lavrenko et 
al as applied respectively to claims 1 and 16 above, and further in view of US Patent No 
6,584,220 issued to Lantrip et al (hereafter Lantrip et al). 

Referring to claim 3, FinalReport/RelevanceModel teaches a method for new 
event detection as claimed in claim 1 . FinalReport/RelevanceModel also teaches the 
further limitation wherein the inter-story similarity metric is at least one of a probability 
based inter-story similarity metric (see section 3.2: Measuring Topic Similarity, lines 1- 
14). However, FinalReport/RelevanceModel does not explicitly teach the further 
limitation wherein the inter-story similarity metric is also a Euclidean based inter-story 
similarity metric. Lantrip et al teaches a method of similarity metrics including a 
Euclidean based inter-story similarity metric (Lantrip et al: see column 4, lines 46-48). 

It would have been obvious to one of ordinary skill at the time the invention was 
made to utilize Lantrip et al's method of a Euclidean based inter-story metric with Final 
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Report/RelevanceModel's method of a probability based inter-story metric. One would 
have been motivated to do so since a Euclidean based inter-story metric aids in solving 
the problem of how to train and cluster a corpus (Final Report: see section 1 .2). 

Referring to claim 4, the combination of (FinalReport/RelevanceModel and 
Lantrip et al (hereafter FinalReport/RelevanceModel/Lantrip) teaches the method of 
claim 3, wherein the probability based inter-story similarity metric is at least one of a 
Hellinger, a Tanimoto, a KL divergence (Relevance Models: see section 3.2: Measuring 
Topic Similarity, lines 1-14 - Kullback-Leibler divergence) and a clarity distance based 
metric. 

Referring to claim 5, FinalReport/RelevanceModel/Lantrip teaches the method 
of claim 3, wherein the Euclidean based similarity metric is a cosine-distance based 
metric (Lantrip et al: see column 4, lines 46-48). 

Referring to claim 18, the claim is rejected on the same grounds as claim 3. 

Referring to claim 19, the claim is rejected on the same grounds as claim 4. 

Referring to claim 20, the claim is rejected on the same grounds as claim 5. 

11. Claims 6 and 21 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
the article "Topic Detection and Tracking Pilot Study Final Report" by Allan et al in view 
of the article "Relevance Models for Topic Detection and Tracking" by Lavrenko et al as 
applied respectively to claims 1 and 16 above, and further in view of the article "On-line 
New Event Detection and Tracking" by Allan et al (hereafter New Event Detection). 
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Referring to claim 6, FinalReport/RelevanceModel teaches a method of 
detecting new events. However FinalReport/RelevanceModel fails to explicitly teach the 
further limitation wherein the inter-story metrics are determined based on a term 
frequency-inverse story frequency model. New Event Detection discloses a method 
similar to that of FinalReport/RelevanceModel. New Event Detection discloses a 
method similar to that of FinalReport/RelevanceModel including the further limitation. 

In particular, New Event Detection discloses a method similar to that of claim 1 , 
wherein the inter-story similarity metrics are determined based on a term frequency- 
inverse story frequency model (see section 4.1: Detection Experiments, lines 13-15). 

It would have been obvious to one of ordinary skill at the time the invention was 
made to utilize New Event Detection's method of a term frequency-inverse story 
frequency with Final Report/RelevanceModel's method of an inter-story similarity 
metrics. One would have been motivated to do so since all three articles discuss the 
TDT initiative carried out by the Center for Intelligent Information Retrieval (Final Report: 
see abstract; Relevance Model: see abstract; New Event Detection: see abstract). 

Referring to claim 21, the claim is rejected on the same grounds as claim 6. 
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Response to Arguments 

12. Referring to applicants' remarks on page 12 regarding the Section 103 rejection 
of claims 1 , 16, 31 , 32 and 35: Applicant argues that "The communication at page 7, 
lines 4-6 asserts that Allanl teaches "a same-event-same source story characteristic 
see Section 3.1: Detection Evaluation: lines 34-44)". However, this background portion 
of the Allanl reference describes the use of clusters. See example, lines 43-45 
describing how "the degree of match between an event and a cluster is defined to be 
the number of stories that belong to both the event and the cluster." 

The examiner respectfully disagrees. The cluster is considered to represent a 
group of events. The source is considered to be the TDT corpus. Therefore, the source 
is the same for all of the clusters, which means the clusters are organized by event and 
source. 

Applicants continue to argue that the reference fails to disclose determining a 
source-identified story corpus, each associated with at least one event. 

The examiner respectfully disagrees. The TDT is considered to be a source 
identified corpus regardless of what manner the stories are stored. TDT is considered 
to represent the source. 
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Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Kimberly Lovel whose telephone number is (571) 272- 
2750. The examiner can normally be reached on 8:00 - 4:00. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, John Cottingham can be reached on (571) 272-7079. The fax phone 
number for the organization where this application or proceeding is assigned is 571- 
273-8300. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a 
USPTO Customer Service Representative or access to the automated information 
system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 
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